CIFICATION 

Electronic Version 1.2.8] 
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Peptides, Proteins, and Similar Structures 



CROSS-REFERENCE TO RELATED 
APPLICATIONS 
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Rates of Peptides, Proteins, Hormones, and Peptide-Like, Protein-Like and Hormone 
Like Molecules. 



STATEMENT REGARDING FEDERALLY 
SPONSORED RESEARCH OR DEVELOPMENT 

Not Applicable 



INCORPORATION-BY-REFERENCE OF MATERIAL 
SUMBMITTED ON A COMPACT DISC 

Enclosed CD of book: Molecular Clocks: Deamidation of Asparaginyl and Glutaminyl 
Residues in Peptides and Proteins., Robinson, N.E. and Robinson, A.B., Althouse Press, 
Cave Junction, OR, ISBN 1-59087-250-9. This 448 page book contains a complete 
review of the subject, including over 1785 references to the research literature, 86 
Figures and 16 Tables. The inventions described in this patent are placed in context by 
this book. 



ACKGROUND OF THE INVENTION 



[0001] The deamidation of peptides and proteins as well as molecules related to 

peptides and proteins is a well known phenomenon. In this reaction, Asn or Gin residues 
are gradually changed into Asp and Glu residues and their isomers respectively. The rate 
of this reaction is dependent on the primary sequence, three-dimensional structure, pH, 
temperature, buffer type, ionic strength and other solution properties. The half-time 
varies from less than 1 day to more than a century. The reaction introduces a negative 
charge into the molecule. In addition, the isomerization products R-Asp and li-Glu as 
well D-isomerized forms and chain cleavage also accompany the reaction. 

[0002] The stability of Asn and Gin in pharmaceutical and other types of commercial 

preparations is a major field of study. Efforts have been made to discover formulation 
conditions that will minimize the rate of deamidation of amides in these preparations. 
There is also commercial potential in induced or controlled deamidation as an active 
aspect of the product. 

BRIEF SUMMARY OF THE INVENTION 

For the purposes of this work the definition of terms is as follows: Asn - 
Asparaginyl residue in a peptide or protein, Gin - Glutaminyl residue in a peptide or 
protein. 

[0003] The inventions described here pertain to the engineering of peptides, 

hormones, and proteins as well as peptide-like, hormone-like and protein-like 
molecules. 

[0004] It is well known that for peptide sequences of the type AsnXxx and GlnXxx, 

where Xxx is any natural or unnatural amino acid, the rate of deamidation of either Asn 
or Gin depends very strongly on the identity of Xxx. These results are applicable to 
peptides, proteins and hormones as well as any amide-containing molecule with similar 
structure. It is also applicable to isomerization of AspXxx and GluXxx sequences. 

[0005] I have done extensive work showing the quantitative sequence dependence 

of these reactions. I have also invented a method for applying this sequence dependence 
to proteins, peptides, and other similar molecules, in conjunction with their three- 
dimensional structures. 

[0006] These inventions allow the prediction of deamidation rates of amides as a 

function of primary and three-dimensional structure, if the three-dimensional 



1 



structures is known. They also provide quantitative information about the parameters 
that make up these rates and show which structural elements are important for each 
rate. 

[0007] These inventions can be used to modify predictably structural elements to 

provide stability or controlled instability in amides or acids in pharmaceutical and other 
types of commercial preparations. Specifically there are three major types of 
modifications that can be made that will change the rate by amounts that can be 
quantitatively or qualitatively determined from these inventions. Asp and Glu residues 
also undergo reactions controlled in this way. 

[0008] 1 . Modification of the residue or residue-like structure to the carboxyl-side 

or amino-side of the amides or acids. This can be done by substitution of a different 
natural or non-natural amino acid side chain. 

[0009] 2. Exchange of Asn for Gin or Gin for Asn. Gin deamidation and probably Glu 

isomerization is substantially slower by a quantitative amount. 

[001 0] 3. Modification of other surrounding structural elements that affect the rate 

of the reaction as determined by my current three-dimensional calculation procedure or 
a similar procedure resulting from improvements in the current method. 

[001 1] These inventions allow the engineering of molecules with specific amide 

structures that will deamidate at specified rates. These procedures can be used to 
design stable and unstable forms for pharmaceutical, industrial, and other products. 
This can be used to increase the shelf-life of such products through minor 
modifications, prevent or at least slow down the gradual formation of impurities in 
preparations with these modifications, and may make possible as a result of minor 
modifications the use of products that would otherwise be too unstable for practical 
purposes. The engineering of products with unstable amides that are programmed to 
deamidate at specific rates is also a valuable application of this procedure. 



BRIEF DESCRIPTION OF THE SEVERAL VIEWS 
OF THE DRAWING(S). 

Not Applicable 



DETAILED DESCRIPTION OF THE INVENTION 



[0012] General Method: 

[001 3] It was known before the invention of the method described here, that the 

sequence and structure around an amide has a large effect on the deamidation rate. 
Unknown, however, was the necessary quantitative information that would allow 
engineering of stable amides or amides with predetermined rates based on modification 
of the sequence and three-dimensional structure. 

[0014] Experiments which I carried out showed quantitatively the effects of 

sequence dependent deamidation. One of the discoveries made was that the sequence 
dependence of deamidation is much richer and covers a wider range than previously 
thought. In 37°C, pH 7.4, 0.1 5 M Tris buffer, the combination of XxxAsnYyy and 
XxxGlnYyy sequences where Xxx and Yyy are any of the naturally occuring amino acids 
covers a range from less than 1 day to over 1 5,000 days with the entire range in 
between available. 

[001 5] In addition to the sequence dependent work, I have also invented methods 

that allow application of this sequence dependent data to three-dimensional protein 
structures to permit the prediction of protein deamidation rates. This method is 
applicable to any peptide type structures including peptides, hormones, and proteins 
and peptide-like, hormone-like, and protein-like molecules, as well as similar 
structures that deamidate in the same way. 

[001 6] This prediction procedure is based on identifying structural elements in a 

protein or similar molecule that contribute to the rate in known quantitative ways. These 
include, but are not limited to, hydrogen bonds of various types, disulfide bonds, alpha- 
helices, and beta-sheets. The effect of each structure depends on a variety of 
quantitative factors. 

[001 7] The invention of these prediction techniques had never been attempted 

before. Not only do they allow prediction of deamidation rates to very high reliability, 
but the calculation shows what structural features are responsible for each particular 
rate and what changes would be necessary to modify the rate in a quantitative manner. 



[0018] 



Sequence Dependence: 



[001 9] Tables 1 and 2 show the sequence dependence of deamidation measured 

using natural amino acid variations in pentapeptides. Non-natural variations provide an 
even greater range of sequences to choose from. 

[0020] Table 1 describes the sequence dependence of Asn sequences. It is based on 

pentapeptide rates measure in 37°C, pH 7.4, 0.1 5 M Tris buffer. The applicability of a 
pentapeptide model to sequence dependence was verified in a separate set of 
experiments. All values listed in this table are experimental except for the four values in 
boxes, which were estimated from the rest of the data. 
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[0021] Table 2 describes the sequence dependence of Gin peptides. It is also based 

on pentapeptide rates measure in 37°C, pH 7.4, 0.1 5 M Tris buffer. In this case, the 52 
values shown in bold were measured, and the rest of the values were derived from 
surface fitting. 

[0021b] Tables 1 and 2 were published in: Robinson, N.E., Robinson, Z.W., Robinson, 

B.R., Robinson, A.L, Robinson, J.A., Robinson, M.L, and Robinson, A.B., (May 2004) 
Structure-dependent nonenzymatic deamidation of glutaminyl and asparaginyl 
pentapeptides,/ Peptide Res., 63, 426-436. 
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[0022] Deamidation rates are affected by a wide variety of parameters, including, 

pH, Temperature, Ionic Strength, and Buffer Ions. These rates are measured under pH 
and Temperature conditions that are applicable to biological systems. The buffer type 
and concentration were chosen to minimize ion affects to the extent possible given the 
experimental limitations. Modification of these conditions will change the rates in Tables 
1 and 2. As long as the conditions are not taken to extremes (i.e. high temperature, or 
strongly acidic or basic conditions) the qualitative sequence dependence should remain 
the same and the rates reported here can be used with necessary adjustments. 

[0023] It is also clear that direct hydrolysis of Gin and Asn take place in addition to 

the regular sequence dependent mechanism. This hydrolysis is sequence dependent as 
well, but an average value of about an 8000 day half-time can be taken as a rough 
approximation based on this and other data measured at the same time. This does not 
effect the Asn rates significantly, but is responsible for the leveling off of the Gin rates 
at around this level. This hydrolysis is also effected by the reaction conditions. 

[0024] The sequence dependence apparent in Tables 1 and 2 is of great value in 

engineering stable amides, unstable amides, or amides with particularly desired rates. 
Isomerization of acid residues will follow a very similar sequence dependence, offset by 
a determinable amount. 

[0025] Gin vs. Asn Deamidation: 

[0026] It is apparent from the data shown in Tables 1 and 2 that the deamidation 

rates of Asn and Gin cover markedly different ranges. One of the discoveries in these 
experiments was that their sequence dependencies are complementary. Asn sequences 
cover the range from about 1 day to 450 days. Gin picks up at 560 days and carries 
these rates out to tens of thousands of days. 

[0027] This opens up a new possibility for engineering of amide rates. It is possible 

to switch half-time ranges simply by substituting Asn for Gin or Gin for Asn depending 
on the desired effect. In many cases where it is desirable to introduce or leave in place 
an amide, the difference of one CH 2 group in chain length may not be critical. 

[0028] Moreover, the fact that this range switching can be done raises another 

possibility. Other modifications of Gin and Asn may lie in different ranges. Thus the 
substitution of unnatural amide side-chains is also a valuable procedure. 

[0029] Three-Dimensional Effects of Deamidation: 

[0030] The invention of the three-dimensional prediction method for deamidation 

rates has been developed in two phases. The first of these was the invention of a 
technique for determining deamidation rates in proteins based on manually counting 
the number of each type of structure that can affect the rate. Each of these effects is 



then summed with special coefficients to produce the correct rate. The procedure was 
calibrated on known relative deamidation rates and then found to be quite accurate in 
predicting absolute rates. 

[0031] Secondly, the procedure was adapted to an automated method by means of 

an extensive C++ program. Some modifications were made when this was done, but the 
basic procedure remained the same. 

[0032] I am not attempting to patent this C++ program. There are many ways to 

write such programs and the current version is protected by copyright. What is being 
patented is the method used to write it which is based on the manual procedure and 
minor modifications and improvements that are particularly adapted to computerized 
calculation and include many conceptual innovations. 

[0033] It will be obvious to anyone who studies and understands these methods that 

there are variations in the procedure and even some improvements that could be made 
which would yield similar results. Any such modifications are understood to be products 
of this invention and come under the scope of this patent. 

[0034] The deamidation coefficient, C D , for and amide is defined as: C D = (0.01 )(t- 

pl/2 )( e f(Cm. CSn.Sn)) 

[0035] Here ti /2 is the pentapeptide primary structure half life, C m is a structure 

proportionality factor, C Sn is the 3D structure coefficient for the nth structure 
observation, S n is that observation, and f(C m , C Sn , S n ) = C m [(C S i)(Si) + (CS2KS2) + (CS3XS3) 
- (Cs4, 5 )(S4)/(S 5 ) + (CseXSe) + (C S7 )(S 7 ) + (C S8 )(S 8 ) + (C S9 )(S 9 ) + (C S io)(l - S10) + (C S n)(5 - 
Sn) + (Csi 2 )(5 - S12)]. The structure observations, S n , were selected as those most likely 
to impede deamidations, including hydrogen bonds, a-helices, f$-sheets, and peptide 
inflexibilities. The functional form of C D assumes that each of these structural factors is 
added to the reaction activation energy. The observed S n were: 

[0036] For Asn in an <x-helical region: 

[0037] Si = distance in residues inside the a-helix from the NH 2 end, where Si = 

1 designates the end residue in the helix, 2 is the second residue, and 3 is the third. If 
the position is 4 or greater, Si = 0. 

[0038] S 2 = distance in residues inside the a-helix from the COOH end, where Si 

= 1 designates the end residue in the helix, 2 is the second residue, and 3 is the third. If 
the position is 4 or greater or Si * 0, then S 2 = 0. 

[0039] S3 = 1 if Asn is designated as completely inside the a-helix, because it is 

4 or more residues from both ends. If the Asn is completely inside, S 3 = 1 , Si = 0, and S- 

2 = 0. If Si * 0 or S 2 * 0, then S 3 - 0. 



[0040] For flexibility of a loop including Asn between two adjacent antiparallel 
Psheets: 

[0041] S 4 = number of residues in the loop. 

[0042] S 5 = number of hydrogen bonds in the loop. S 5 > 1 by definition. 

[0043] For hydrogen bonds: 

[0044] S 6 = the number of hydrogen bonds to the Asn side chain C=0 group. 
Acceptable values are 0, 1 , and 2. 

[0045] S7 = the number of hydrogen bonds to the Asn side chain NH 2 group. 

Acceptable values are 0, 1 , and 2. 

< 

[0046] Ss = the number of hydrogen bonds to the backbone nitrogen atom in the 

peptide bond on the COOH side of Asn. Hydrogen bonds counted in Se or S 7 are not 
included. Acceptable values are 0 and 1 . This nitrogen atom is used in the five- 
membered succinimide ring. 

[0047] S 9 = additional hydrogen bonds, not included in S 6 , S 7 , and S 8 , that would 

need to be broken to form the succinimide ring. 

[0048] For Asn situated so that no a-helix, fj-sheet, or disulfide bridge structure is 

between the Asn and the end of the peptide chain: 

[0049] S10 = 1 if the number of residues between the Asn and the nearest such 

structure is 3 or more. If the number of intervening residues is 2, 1 , or 0, or Asn not 
between structure and chain end, then S10 = 0. 

[0050] If the Asn lies near to any cx-helix, fJ-sheet, or disulfide bridge structures: 

[0051] Sn = the number of residues between the Asn and the structure on the NH 2 

side, up to a maximum of 5. Values of 0, 1 , 2, 3, 4, and 5 are acceptable. 

[0052] S12 = the number of residues between the Asn and the structure on the 

COOH side, up to a maximum of 5. Values of 0, 1 , 2, 3, 4, and 5 are acceptable. 

[0053] Hydrogen bonds selected by the Swiss Protein Data Bank (PDB) viewer were 

accepted if the bond length was 3.3 A or less and there was room in the structure to 
accommodate the van der Waals radius of the hydrogen. In the computerized procedure 
this bond length was optimized at 4.1 A, and the bond angles and number of bonds per 
atom were adjusted to physically correct and optimized values. The Swiss PDB viewer, 
according to the customary criteria, selected a-helices and p-sheets. All primary 



structure ti/2 values were those published 6 , except for Asn with carboxyl-side Pro, Asn, 
or Gin and N-glycosylated Asn. Estimated values were used for any sequence for which 
the primary sequence rate was not known. 

[0054] Coefficients Used in Equation: 

[0055] C D values ("Coefficient of Deamidation") were optimized by using various 

values for C m and C Sn to maximize the value of the deamidation resolving power, D P , as 
described in the calibration procedure section. The optimized values were C m = 0.48, C- 
si = 1 .0, C S2 = 2.5, C S3 = 1 0.0, Cs4, 5 = 0.5, C S6 = 1 .0, C S7 = 1 -0, C S8 = 3.0, C S9 = 2.0, Csio 
= 2.0, Csn = 0.2, and Csiz = 0.7. 

[0056] As an example, the f$-LysAsn(l 45)His sequence of hemoglobin is not in an 

(x-helix or in a loop between two psheets, so Si through S 4 = 0, S 5 = 1 . There is one 
hydrogen bond to the amide side chain nitrogen and one other to be broken to form the 
imide, but there are none to the amide carboxyl or the backbone nitrogen, so S 6 = 0, S 7 
= 1 , Ss = 0, and Sg = 1 • This Asn is near the carboxyl end of the chain and one residue 
from an <x-helix on the amino side, so Sio = 0, Sn = 1 , and S12 = 5. The 
GlyLysAsnHisCly half life 6 is 10.5 days. Therefore, C D = (0.01)(10.5)e- 
(o.48)i(i )(i)+(2)(o+(2)(io)+(o.2)(4)] - (Q. 1 05)e<°- 48 >< 5 - 8 > = (0. 1 05)(1 6. 1 84) = 1.70. 

[0057] C D is multiplied by 100 to give the predicted Tris deamidation half-time in 

days for the amide. 

[0058] Results for Asn are greater than 95% correct in predicting the fastest amide 

in a protein. It is also applicable to Gin. 

[0059] It is also likely that isomerization of Asp and Glu can be modeled with the 

same procedure. Primary rate data on Asp and Glu isomerization or a correction factor 
to be applied to the Asn and Gin data is needed in order to do this. 

[0060] Conclusions: 

[0061] Three different types of modifications that can be used in the engineering of 

deamidation and/or isomerization rates of amides and possibly acids have been 
invented. These are: 

[0062] 1 . Modification of the residues or residue-like structures on either side of 

the amide - principally the one on the right (carboxyl side). 

[0063] 2. Modification of the amide - specifically Asn to Gin or Gin to Asn, but other 

types of modification can also be used, especially in the case of structures that are 
similar, but not a perfect match to those found in peptides, hormones, and proteins. 



[0064] 3. Modification of the three-dimensional environment around the amide. The 

necessary modifications can be determined from the three-dimensional deamidation 
prediction method. Each of the S parameters describes a quantitative addition to the 
reaction activation energy. Removal or addition of one or more of these elements will 
change the rate accordingly. 

[0065] At least two types of deamidation are present. The ones on which this 

method is based, and which are most prevalent for amides with half-times less than a 
few hundred days, depending on conditions and providing especially catalytic ions are 
not present, are most strongly effected by the structure to the right of the amide (e.g. in 
the sequence GlyXxx(Amide/Acid)YyyGly the identity of Yyy is the most important 
factor). Also present is at least one more mechanism that is usually slower and has 
different sequence dependence. It is possible that this dependence as well as the left 
hand structure dependence (Xxx in the sequence GlyXxx(Amide/Acid)YyyCly) can also be 
modeled with a similar system, but this has not yet been demonstrated. 



